Planning with Partially Specified Behaviors

نویسندگان

  • Javier Segovia Aguas
  • Jonathan Ferrer-Mestres
  • Anders Jonsson
چکیده

In this paper we present a framework called PPSB for combining reinforcement learning and planning to solve sequential decision problems. Our aim is to show that reinforcement learning and planning complement each other well, in that each can take advantage of the strengths of the other. PPSB uses partial action specifications to decompose sequential decision problems into tasks that serve as an interface between reinforcement learning and planning. On the bottom level, we use reinforcement learning to compute policies for achieving each individual task. On the top level, we use planning to produce a sequence of tasks that achieves an overall goal. Experiments show that our framework is competitive with realistic environments where a robot has to perform some tasks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Synthesis of Situation Control Rules under Exogeneous Events

One approach for computing plans for reactive agents is is to check goal statements over state trajectories modeling predicted behaviors of an agent. This paper describes a powerful extension of this approach to handle time, safety, and liveness goals that are specified by Metric Temporal Logic formulas. Our planning method is based on an incremental planning algorithm that generates a reactive...

متن کامل

Self-Efficacy and Planning Predict Dietary Behaviors in Costa Rican and South Korean Women: Two Moderated Mediation Analyses

Dietary planning is supposed to mediate between intentions and dietary behaviors. However, if a person lacks self-efficacy, this mediation might fail. A cross-sectional study in Costa Rica and a longitudinal study in South Korea were designed to examine the moderating role of self-efficacy in the intention– planning–behavior relationship. Intentions, planning, self-efficacy, dietary behaviors, ...

متن کامل

Intentions, planning, and self-efficacy predict physical activity in Chinese and Polish adolescents: Two moderated mediation analyses1

Planning is assumed to translate intentions into health behaviors. However, this may fail due to a lack of perceived self-efficacy. People do not tackle challenging tasks if they harbor self-doubts, even if they have made a good action plan. The present two descriptive longitudinal studies are designed to examine the putative moderating role of self-efficacy in the planning-behavior relationshi...

متن کامل

Decomposition and Causality in Partial-order Planning

We describe DPOCL, a partinl-order csnsal llnk planner that includes action decomposition. DPOCL builds directly on the SNLP algorithm (McAllester Rosenbiltt 1991), and hence is clear and simple, ud can readily be integrated with other SNLP extensions. In addition, DPOCL is specifically designed to handle partially specified action decompositions. Plan generation in DPOCL exploits the planner’s...

متن کامل

Automated Hierarchy Discovery for Planning in Partially Observable Environments

Planning in partially observable domains is a notoriously difficult problem. However, in many real-world scenarios, planning can be simplified by decomposing the task into a hierarchy of smaller planning problems. Several approaches have been proposed to optimize a policy that decomposes according to a hierarchy specified a priori. In this paper, we investigate the problem of automatically disc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016